Goto

Collaborating Authors

 domain-invariant projection learning


Domain-Invariant Projection Learning for Zero-Shot Recognition

Neural Information Processing Systems

Zero-shot learning (ZSL) aims to recognize unseen object classes without any training samples, which can be regarded as a form of transfer learning from seen classes to unseen ones. This is made possible by learning a projection between a feature space and a semantic space (e.g.


Reviews: Domain-Invariant Projection Learning for Zero-Shot Recognition

Neural Information Processing Systems

The authors present an algorithm for zero-shot learning based on learning a linear projection between the features of a pre-trained convnet and a semantic space in which to do nearest neighbors classification. Overall, I found the exposition of the paper very confusing, and I am still struggling to understand the exact set-up in the experiments and theorems after several re-readings. I'm not 100% certain from the paper what the authors are actually doing and what they have available as training data. In particular, the authors denote the test images by D_u, which they are sure to point out are unlabeled. They then define l_i {(u)} to be the label of test point x_i {(u)} - how can the label be included in a set of test images that is unlabeled?


Domain-Invariant Projection Learning for Zero-Shot Recognition

Zhao, An, Ding, Mingyu, Guan, Jiechao, Lu, Zhiwu, Xiang, Tao, Wen, Ji-Rong

Neural Information Processing Systems

Zero-shot learning (ZSL) aims to recognize unseen object classes without any training samples, which can be regarded as a form of transfer learning from seen classes to unseen ones. This is made possible by learning a projection between a feature space and a semantic space (e.g. Key to ZSL is thus to learn a projection function that is robust against the often large domain gap between the seen and unseen classes. In this paper, we propose a novel ZSL model termed domain-invariant projection learning (DIPL). Our model has two novel components: (1) A domain-invariant feature self-reconstruction task is introduced to the seen/unseen class data, resulting in a simple linear formulation that casts ZSL into a min-min optimization problem.